SEQUENCE LISTING 



<110> Kordyum, Vitally A. 

Chernykh, Svitlana I. 
Slavchenko, Iryna Yu. 
Vozianov, Oleksandr 

<120> PHAGE-DEPENDENT SUPER PRODUCTION OF 
BIOLOGICALLY ACTIVE PROTEIN AND PEPTIDES 





<130> 


PHAGE. 00 6A 




<150> 


09/318, 288 




<151> 


1999-05-25 




<160> 


11 




. <170> 


FastSEQ for Windows Version 




<210> 


1 




<211> 


630 




<212> 


DNA 


c ; 


<213> 


Artificial Sequence 




<220> 






<223> 


This sequence was chemically 






upon the amino acid sequence 



m 



fibroblast growth factor (155. amino acids) using 
codons which are used in highly expressed proteins 
from E. coli. 



<221> CDS 
<222> (122) 



[590) 



<400> 1 

gcgtagagga tcgagatctc gatcccgcga aattaatacg actcactata ggggaattgt 60 

gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag gagatataca 120 

t atg get gaa ggg gaa ate acc acc ttt aca gcg tta acg gag aaa ttt 169 

Met Ala Glu Gly Glu He Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 
15 10 15 



aac ctt ccg ccc ggg aat tac aaa aaa ccc aag ctt ctt tac tgc agt 
Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 
20 ■ 25 30 



217 



aac gga gga cac ttc ctg cga att ctg cca gat ggc aca gta gat ggg 
Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 
35 40 45 



265 



act cgc gat cgc tec gac cag cac att cag ctg caa etc teg gee gaa 
Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser Ala Glu 
50 55 60 



313 



age gtt gga gag gtc tat ate aag teg acg gag. act ggc eag tac ctt 
Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 



361 



27 



65 70 75 80 

gcc atg gac acc gat ggg ctt ctg tat ggc tea cag acg cct aac gaa 409 
Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu 
85 90 95 

gaa tgc ttg ttt eta gaa aga eta gaa gaa aae cat tac aae aeg tae 457 
Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 
100 105 110 

ata teg aaa aaa cat gea gag aag aac tgg ttt gta ggc ctt aaa aaa 505 
lie Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys 
115 120 125 

aat ggt tec tgt aag cgt gga cea egg act cac tat ggc caa aag get 553 
Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 
130 135 140 

ate ttg ttc ctg cca eta eca gtg age tec gac taa g gatccgaatt 600 
lie Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
145 150 155 

cgagetccgt egacaagett geggcegcac 630 

<210> 2 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Ala Glu Gly Glu He Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 

1 5 10 15 

Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 

20 25 30 

Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 

35. 40 45 

Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser Ala Glu 

50 55 ' 60 ■ 

Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 
65 70 75 80 

Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro, Asn Glu 

85 90 95 

Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 

100 105 110 

He Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys 

115 120 125 

Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 

130 135 140 

He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
145 150 155 



<210> 3 
<211> 468 
<212> DNA 

<213> Homo sapiens 



28 



<400> 3 

atggctgaag gggaaatcac caccttcaca gccctgaccg agaagtttaa tctgcctcca 60 

gggaattaca agaagcccaa actcctctac tgtagcaacg ggggccactt cctgaggatc 120 

cttccggatg gcacagtgga tgggacaagg gacaggagcg accagcacat tcagctgcag 180 

ctcagtgcgg aaagcgtggg ggaggtgtat ataaagagta ccgagactgg ccagtacttg 240 

gccatggaca ccgacgggct tttatacggc tcacagacac caaatgagga atgtttgttc 300 

ctggaaaggc tggaggagaa ccattacaac acctatatat ccaagaagca tgcagagaag 360 

aattggtttg ttggcctcaa gaagaatggg agctgcaaac gcggtcctcg gactcactat 420 

ggccagaaag caatcttgtt tctccccctg ccagtctctt ctgattaa 468 



<210> 4 
<211> 630 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> This sequence is a chemically synthesized sequence 
encoding a 134 amino acid form of fibroblast 
growth factor with alterations for preferred codon 
usage in E. coii 



<221> CDS 

<222> (122) . . . (526) 



<400> 4 

gcgtagagga tcgagatctc gatcccgcga aattaatacg actcactata ggggaattgt 60 
gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag gagatataca 120 
t atg aat tac aaa aaa ccc aag ctt ctt tac tgc agt aac gga gga cac 169 
Met Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser Asn Gly Gly His 
1 5 10 15 



ttc ctg cga att ctg cca gat ggc aca gta gat ggg act cgc gat cgc 
Phe Leu Arg lie Leu Pro Asp Gly Thr Val Asp Gly Thr Arg Asp Arg 
20 25 30 



217 



tec gac cag cac att cag ctg caa etc teg gee gaa age gtt gga gag 265 
Ser Asp Gin His lie Gin Leu Gin Leu Ser Ala Glu Ser Val Gly Glu 
35 40 45 



gtc tat ate aag teg acg gag act ggc cag tac ctt gee atg gac ace 313 
Val Tyr. lie Lys Ser Thr Glu Thr Gly Gin Tyr Leu Ala Met Asp Thr 
50 55 60 

gat ggg ctt ctg tat ggc tea cag acg cct aac gaa gaa tge ttg ttt 361 
Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu Glu Cys Leu Phe 
65 70 75 80 

eta gaa aga eta gaa gaa aac cat tac aae acg tac ata teg aaa aaa 409 
Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr lie Ser Lys Lys 
85 90 95 



eat gea gag aag aac tgg ttt gta ggc ctt aaa aaa aat ggt tec tgt 
His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys Asn Gly Ser Cys 
100 105 110 



457 



aag cgt gga cca egg act cac tat ggc caa aag get ate ttg tte ctg 
Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala lie Leu Phe Leu 



505 



29 



115 



120 



125 



cca eta cca gtg age tec gac taaggateeg aattegaget ccgtcgacaa 556 
Pro Leu Pro Val Ser Ser Asp 
130 135 

gcttgcggec gcactcgagc accaccacea ceaecactga gatccggctg etaaeaaage 616 
ecgaaaggaa getg 630 

<210> 5 
<211> 135 
<212> PRT 

<213> Homo sapiens 



<220> 

<223> Translated protein sequence for the ehemically 
synthesized 134 amino aeid form of fibroblast 
growth factor 

<400> 5 



Met 


Asn 


Tyr 


Lys 


Lys 


Pro 


Lys 


Leu 


Leu 


Tyr 


Cys 


Ser 


Asn 


Gly 


Gly 


His 


1 








5 










10 










15 




Phe 


Leu 


Arg 


He 
20 


Leu 


Pro 


Asp 


Gly 


Thr 
25 


Val 


Asp 


Gly 


Thr 


Arg 
30 


Asp 


Arg 


Ser 


Asp 


Gin 


His 


He 


Gin 


Leu 


Gin 


Leu 


Ser 


Ala 


Glu 


Ser 


Val 


Gly Glu 






35 










40 










45 








Val 


Tyr 
50 


He 


Lys 


Ser 


Thr 


Glu 
55 


Thr 


Gly 


Gin 


Tyr 


Leu 
60 


Ala 


Met 


Asp 


Thr 


Asp 


Gly 


Leu 


Leu 


Tyr 


Gly 


Ser 


Gin 


Thr 


Pro 


Asn 


Glu 


Glu 


Cys 


Leu 


Phe 


65 










70 










75 










80 


Leu 


Glu 


Arg 


Leu 


Glu 
85 


Glu 


Asn 


His 


Tyr 


Asn 
90 


Thr 


Tyr 


He 


Ser 


Lys 
95 


Lys 


His 


Ala 


Glu 


Lys 
100 


Asn 


Trp 


Phe 


Val 


Gly 
105 


Leu 


Lys 


Lys 


Asn 


Gly 
110 


Ser 


Cys 


Lys 


Arg 


Gly 
115 


Pro 


Arg 


Thr 


His 


Tyr 
120 


Gly 


Gin 


Lys 


Ala 


He 
125 


Leu 


Phe 


Leu 


Pro 


Leu 
130 


Pro 


Val 


Ser 


Ser 


Asp 
135 





















<210> 6 
<211> 630 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This sequence is a chemically synthesized sequence 
encoding a 14 0 amino acid form of fibroblast 
growth factor with alterations for preferred codon 
usage in E. coli 

<221> CDS 

<222> (122) . . . (544) 
<400> 6 

gegtagagga tegagatcte gatcecgcga aattaatacg aetcactata ggggaattgt 60 
gagcggataa caatteccct etagaaataa ttttgtttaa ctttaagaag gagatataca 120 



30 



t atg ttt aac ctt ccg ccc ggg aat tac aaa aaa ccc aag' ctt ctt tac 
Met Phe Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr 
15 10 15 



tgc agt aac gga gga cac ttc ctg cga att ctg cca gat ggc aca gta 
Cys Ser Asn Gly Gly His Phe Leu Arg lie Leu Pro Asp Gly Thr Val 
20 25 30 

gat ggg act cgc gat cgc tec gac cag cac att cag ctg caa etc teg 
Asp Gly Thr Arg Asp Arg Ser Asp Gin His lie Gin Leu Gin Leu Ser 
35 40 45 

gcc gaa age gtt gga gag gtc tat ate aag teg acg gag act gge cag 
Ala Glu Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin 
50 55 60 

tac ctt gcc atg gac ace gat ggg ctt ctg tat ggc tea cag aeg cct 
Tyr Leu Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro 
65 70 75 80 

aac gaa gaa tgc ttg ttt eta gaa aga eta gaa gaa aac cat tac aac 
Asn Glu Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn 
85 90 95 

acg tac ata teg aaa aaa eat gea gag aag aac tgg ttt gta ggc ctt 
Thr Tyr He Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu 
100 105 110 

aaa aaa aat ggt tee tgt aag egt gga cca egg act cac tat ggc caa 
Lys Lys Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin 
115 120 125 

aag get ate ttg ttc ctg cca eta cca gtg age tec gac taaggatccg 
Lys Ala He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
130 135 140 

aattcgaget ccgtcgacaa gcttgeggee gcactcgagc accaccacca ccaccactga 
gatccggetg etaaea 

<210> 7 
<211> 141 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Translated protein sequence for the chemically 
synthesized 140 amino acid form of fibroblast 
growth factor 

<400> 7 

Met Phe Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr 

15 10 15 

Cys Ser Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val 

20 25 -30 

Asp Gly Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser 

35 40 45 

Ala Glu Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin 



31 



50 55 60 



Tyr 


Leu 


Axa 


iMiec 


Asp 


T" V-i V- 

i nr 


Asp 


biy 


Leu 


Leu 


Tyr 


biy 


Ser 


lain 


T* Vi V 

i nr 


Pro 


65 










70 










75 










80 


Asn 


Glu 


Glu 


Cys 


Leu 

85 


Phe 


Leu 


Glu 


Arg 


Leu 
90 


Glu 


Glu 


Asn 


His 


Tyr 
95 


Asn 


Thr 


Tyr 


He 


Ser 
100 


Lys 


Lys 


His 


Ala 


Glu 
105 


Lys 


Asn 


Trp 


Phe 


Val 

110 


Gly 


Leu 


Lys 


Lys 


Asn 
115 


Gly 


Ser 


Cys 


Lys 


Arg 
120 


Gly 


Pro 


Arg 


Thr 


His 
125 


Tyr 


Gly 


Gin 


Lys 


Ala 
130 


He 


Leu 


Phe 


Leu 


Pro 
135 


Leu 


Pro 


Val 


Ser 


Ser 
140 


Asp 









<210> 8 
<211> 1822 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> TATA__signal 
<222> (102) . . . (107) 

.5 <221>.CDS 

^ <222> (193) . . . (202) 

<221> intron 

<222> (203) . . . (458) 

M <221> CDS 

m <222> (459) . . . (619) 

p <221> intron 

^ <222> (620) . . . (828) 

iZ <221> CDS 

:fj <222> (829) . . . (948) 

M <221> intron 

<222> (949) . . . (1041) 

<221> CDS 

<222> (1042) . . . (1206) 

<221> intron 

<222> (1207) . . . (1459) 

<221> CDS 

<222> (1460) . (1654 ) 

<223> Chemically synthesized sequence for Human Growth 
Hormone using codons preferred for expression in 
E. coli 

<400> 8 

ggagcttcta aattatccat tagcacaagc ccgtcagtgg ccccatgcat aaatgtacac 60 

agaaacaggt gggggcaaca gtgggagaga aggggccagg gtataaaaag ggcccacaag 120 

agaccggctc aaggatccca aggcccaact ccccgaacca ctcagggtcc tgtggacgct 180 



32 



cacctagctg ca atg get aca g gtaagcgccc ctaaaatccc tttgggcaca 
Met Ala Thr 
1 



232 



atgtgtcctg aggggagagg cagcgacctg tagatgggac gggggcacta accctcaggt 292 
ttggggcttc tgaatgagta tcgccatgta agcccagtat ggccaatctc agaaagctcc 352 
tggtccctgg agggatggag agagaaaaac aaacagctcc tggagcaggg agagtgctgg 412 
cctcttgctc tccggctccc tctgttgccc tctggtttct ccccag gc tec egg aeg 469 

Gly • Ser Arg Thr 
5 



tec ctg ete etg get ttt gge etg etc tgc ctg cec tgg ctt caa gag 
Ser Leu Leu Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp Leu Gin Glu 
10 15 20 



517 



gge agt gee ttc cca ace att ece tta tec agg ctt ttt gac aac get 
Gly Ser Ala Phe Pro Thr lie Pro Leu Ser Arg Leu' Phe Asp Asn Ala 
25 30 35 



565 



atg etc ege gee eat cgt ctg cae eag etg gee ttt gac ace tae eag 
Met Leu Arg Ala His Arg Leu His Gin Leu Ala Phe Asp Thr Tyr Gin 
40 45 50 55 



613 



gag ttt gtaagetett ggggaatggg tgegeatcag gggtggcagg aaggggtgae 
Glu Phe 



669 



ttteeecege tgggaaataa gaggaggaga etaaggaget cagggttttt eeegaagcga 729 

aaatgcagge agatgagcac aegctgagtg aggttceeag aaaagtaaca atgggagctg 789 

gtcteeageg tagacettgg tgggcggtcc ttetectag gaa gaa gee tat ate 843 

Glu Glu Ala Tyr He 
60 



cca aag gaa eag aag tat tea ttc etg eag aac cec eag acc tee ete 
Pro Lys Glu Gin Lys Tyr Ser Phe Leu Gin Asn Pro Gin Thr Ser Leu 
65 70 75 



891 



tgt ttc tea gag tet att ecg aca cec tee aac agg gag gaa aca caa 
Cys Phe Ser Glu Ser He Pro Thr Pro Ser Asn Arg Glu Glu Thr Gin 
80 85 90 



939 



eag aaa tee gtgagtggat gecttgacce eaggcgggga tgggggagac 
Gin Lys Ser 
95 



988 



ctgtagteag ageccceggg cageaeagge eaatgcecgt cettcccetg eag aac 

Asn 



1044 



eta gag etg etc ege ate tec etg etg etc ate cag teg tgg etg gag 
Leu Glu Leu Leu Arg He Ser Leu Leu Leu He Gin Ser Trp Leu Glu 
100 105 110 



1092 



cec gtg cag ttc etc agg agt gte ttc gee aac age etg gtg tae gge 
Pro Val Gin Phe Leu Arg Ser Val Phe Ala Asn Ser Leu Val Tyr Gly 
115 120 125 130 



1140 



33 



gcc tct gac age aac gtc tat gac etc eta aag gac eta gag gaa gge 1188 
Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys Asp Leu Glu Glu Gly 
135 140 145 



ate caa aeg etg atg ggg gtgggggtgg egctaggggt ccceaatett 1236 
lie Gin Thr Leu Met Gly 
150 

ggageeeeae tgaetttgag agctgtgtta gagaaaeact getgeectet ttttagcagt 1296 
eeaggccctg acccaagaga aetcaeetta ttcttcattt eccctcgtga ateetetage 1356 
ctttctetac accctgaagg ggagggagga aaatgaatga atgagaaagg gagggageag 1416 
taeccaagcg ettggeetet eettetette ctteaetttg eag agg etg gaa gat 1471 

Arg Leu Glu Asp 
155 

ggc age ccc egg act ggg eag ate ttc aag eag aec tac age aag tte 1519 
Gly Ser Pro Arg Thr Gly Gin lie Phe Lys Gin Thr Tyr Ser Lys Phe 
160 165 ' 170 • 

gae aea aae tea eae aac gat gac gea eta etc aag aac tac ggg etg 1567 
Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu 
175 180 185 



etc tac tgc tte agg aag gae atg gac aag gtc gag aea tte etg ege 1615 

Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg 

190 195 200 

ate gtg eag tgc ege tct gtg gag ggc age tgt gge tte tagctgcceg 1664 

lie Val Gin Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 
205 210 215 

ggtggcatcc ctgtgaeeee teeceagtgc ctctcetggc cttggaagtt gecactccag 1724 

tgcccaccag ccttgtccta ataaaattaa gttgeatcat tttgtetgac taggtgtcet 1784 
ctataatatt atggggtgga ggggggtggt ttggagea 1822 



<210> 9 
<211> 217 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Met Ala Thr Gly Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu 

15 10 15 

Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala Phe Pro Thr lie Pro Leu 

20 25 30 

Ser Arg Leu Phe Asp Asn Ala Met Leu Arg Ala His Arg Leu His Gin 

35 40 45 

Leu Ala Phe Asp Thr Tyr Gin Glu Phe Glu Glu Ala Tyr lie Pro Lys 

50 55 60 

Glu Gin Lys Tyr Ser Phe Leu Gin Asn Pro Gin Thr Ser Leu Cys Phe 
65 70 75 80 

Ser Glu Ser lie Pro Thr Pro Ser Asn Arg -Glu Glu Thr Gin Gin Lys 

85 90 95 

Ser Asn Leu Glu Leu Leu Arg lie Ser Leu Leu Leu lie Gin Ser Trp 

100 105 110 

Leu Glu Pro Val Gin Phe Leu Arg Ser Val Phe Ala Asn Ser Leu Val 
115 120 125 



34 



Tyr 




C J=i v- 


Asp 


Ser 


Asn 


va ± 


T \ 7 V 

iyr 




Leu 


Leu 


Lys 


7\ o 


Leu 


oX U 




1 T n 
1 








1 "3 R 










X 4 U 












ri \T T 1 o 
J, y X a.e 




Th V 

1 niT 


Leu 


lyje u 


ox y 


/ix y 


Leu 


oXU 


7\ o 
riSp 


oxy 


Q o -t^ 
OCX 


P "T 
IT X (J 


rtX y 




















X 00 










1 ^^O 
X DU 


1 nx 


o J- y o ± 1 i 


T 1 pi 
X J- c 


It i i t; 


Lys 


f^l n 
OXIi 


Th T 


T \/"r 

iyr 


"T 

lJ C X 


T .\7Q 

Xjy o 


IT 1 iC 




Th T 




o c; i. 








165 










170 










175 




His 


Asn Asp 


Asp 


Ala 


Leu 


Leu 


Lys 


Asn 


Tyr 


Gly 


Leu 


Leu 


Tyr 


Cys 


Phe 






180 










185 










190 






Arg 


Lys Asp 


Met 


Asp 


Lys 


Val 


Glu 


Thr 


Phe 


Leu 


Arg 


He 


Val 


Gin 


Cys 




195 










200 










205 








Arg 


Ser Val 


Glu 


Gly 


Ser 


Cys 


Gly 


Phe 

















210 215 



<210> 10 
<211> 990 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chemically synthesized sequence for human 
interferon alpha- 2b 



<221> promoter 
<222> (231) . . . (249) 



<221> CDS 
<222> (320) 



. (784) 



O 



<400> 10 

gggcgctgac 

tgttgctcag 

tgattcattc 

gagcacgatc 

tcactatagg 

ttaagaagga 



ttccgcgttt ccagacttta 
gtcgcagacg ttttgcagca 
tgctaaccag taaggcaacc 
atgcgcaccc gtggggccgc 
ggaattgtga gcggataaca 
gatatacat atg get gaa 
Met Ala Glu 
1 



cgaaacacgg aaaccgaaga ccattcatgt 60 
gcagtcgctt cacgttcgct cgcgtatcgg 120 
ccgccagcct agccgggtcc tcaacgacag 180 
cagatctcga tcccgcgaaa ttaatacgac 240 
attcccctct agaaataatt ttgtttaact 300 
ggg gaa ate acc acc ttt aca gcg 352 
Gly Glu He Thr Thr Phe Thr Ala 
5 10 



tta acg gag aaa ttt aac ctt ccg ccc ggg aat tac aaa aaa ccc aag 
Leu Thr Glu Lys Phe Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys 
15 20 25 



400 



ctt ctt tac tgc agt aac gga gga cac ttc ctg cga att ctg cca gat 
Leu Leu Tyr Cys Ser Asn Gly Gly His Phe Leu Arg He Leu Pro Asp 
30 35 40 



448 



ggc aca gta gat ggg act cgc gat cgc tec gac cag cac att cag ctg 
Gly Thr Val Asp Gly Thr Arg Asp Arg Ser Asp Gin His He Gin Leu 
45 50 55 



496 



caa etc teg gee gaa age gtt gga gag gtc tat ate aag teg acg gag 
Gin Leu Ser Ala Glu Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu 
60 65 70 75 



544 



act ggc cag tac ctt gee atg gac acc gat ggg ctt ctg tat ggc tea 
Thr Gly Gin Tyr Leu Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser 



592 



35 



Era 



80 85 90 

cag acg cct aac gaa gaa tgc ttg ttt eta gaa aga eta gaa gaa aae 640 
Gin Thr Pro Asn Glu Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn 
95 100 105 

cat tac aac acg tac ata teg aaa aaa cat gca gag aag aac tgg ttt 688 
His Tyr Asn Thr Tyr lie Ser Lys Lys His Ala Glu Lys Asn Trp Phe 
110 115 120 

gta ggc ctt aaa aaa aat ggt tec tgt aag egt gga cea egg act cac 736 
Val Gly Leu Lys Lys Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His 
125 130 135 

tat ggc caa aag get ate ttg ttc etg cea eta cea gtg age tee gae 784 
Tyr Gly Gin Lys Ala lie Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
140 145 150 155 

taaggatccg aattcgaget ccgtcgaeaa gettgeggee gcaetegage ac.eaecaeca 844 

eeaccactga gatccggctg ctaacaaage ecgaaaggaa getgagttgg etgetgceae 904 

egctgagcaa taactagcat aaccccttgg ggeetetaaa egggtcttga ggggtttttt 964 

getgaaagga ggaaetatat ceggat 990 

<210> 11 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Translated protein sequence for the chemically 
synthesized human interferon alpha-2b 

<400> 11 

Met Ala Glu Gly Glu He Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 

15 10 15 

Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 

20 25 30 

Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 

35 40 45 

Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser Ala Glu 

50 55 , 60 

Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 
65 70 75 80 

Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu 

85 90 95 

Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 

100 105 110 

He Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys 

115 120 125 

Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 

130 135 140 

He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
145 150 155 
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